Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN Inference Engine
نویسندگان
چکیده
منابع مشابه
Hyperdrive: A Systolically Scalable Binary-Weight CNN Inference Engine for mW IoT End-Nodes
Deep neural networks have achieved impressive results in computer vision and machine learning. Unfortunately, state-of-the-art networks are extremely computeand memoryintensive which makes them unsuitable for mW-devices such as IoT end-nodes. Aggressive quantization of these networks dramatically reduces the computation and memory footprint. Binary-weight neural networks (BWNs) follow this tren...
متن کاملA CNN Universal Chip in CMOS Technology
This paper describes the design of a programmable Cellular Neural Network (CNN) chip, with added functionalities similar to those of the CNN Universal Machine. The prototype contains 1024 cells and has been designed in a 1.0μm, n-well CMOS technology. Careful selection of the topology and design parameters has resulted in a cell density of 31 cells/mm2 and around 7-8 bits accuracy in the weight...
متن کاملChipmunk: A Systolically Scalable 0.9 mm2, 3.08 Gop/s/mW @ 1.2 mW Accelerator for Near-Sensor Recurrent Neural Network Inference
Recurrent neural networks (RNNs) are state-of-the-art in voice awareness/understanding and speech recognition. Ondevice computation of RNNs on low-power mobile and wearable devices would be key to applications such as zero-latency voicebased human-machine interfaces. Here we present CHIPMUNK, a small (<1 mm) hardware accelerator for Long-Short Term Memory RNNs in UMC 65 nm technology capable to...
متن کاملArchitecture Design of a Scalable Single-Chip Multi-Processor
Now that system-on-chip technology is emerging, singlechip multi-processors are becoming feasible. A key problem of designing such systems is however the complexity of their interconnect and memory architecture [1]. An example of a single-chip multi-processor for real-time (embedded) systems is the Multi Micro Processor (M P). Its architecture consists of a scalable number of identical master p...
متن کاملStochastic Inference for Scalable Probabilistic Modeling of Binary Matrices
Fully observed large binary matrices appear in a wide variety of contexts. To model them, probabilistic matrix factorization (PMF) methods are an attractive solution. However, current batch algorithms for PMF can be inefficient because they need to analyze the entire data matrix before producing any parameter updates. We derive an efficient stochastic inference algorithm for PMF models of fully...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Journal on Emerging and Selected Topics in Circuits and Systems
سال: 2019
ISSN: 2156-3357,2156-3365
DOI: 10.1109/jetcas.2019.2905654